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DETAILED ACTION 

1 . This action is responsive to communications: The RCE filed 1 1/01/07. 

2. The rejection of claims 1-20 under 35 U.S.C. 1 12, first paragraph, as failing to comply 
with the written description requirement has been withdrawn as necessitated by Amendment. 

3. Claim 21 has been added as necessitated by Amendment. 

4. All previous rejections to the claims have been withdrawn as necessitated by 
Amendment. 

5. Claims 1-21 are pending in the case. Claims 1, 7, and 14 are independent claims. 

Claim Rejections - 35 USC §103 

6. The following is a quotation of 35 U.S.C. 103(a) which forms the basis for all 
obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set forth in 
section 102 of this title, if the differences between the subject matter sought to be patented and the prior art are 
such that the subject matter as a whole would have been obvious at the time the invention was made to a person 
having ordinary skill in the art to which said subject matter pertains. Patentability shall not be negatived by the 
manner in which the invention was made. 

7. Claims 1-2, 4-5, 7-8, 11-16, and 18-21 are rejected under 35 U.S.C. 103(a) as being 
unpatentable over Doerre et al (US-6,446,061 09/03/02) in view of Weiser et al (US-5,982,507 
1 1/09/99) in view of in further view of Chakrabarti et al (US-6,418,433 07/09/02). 

-In regards to independent claims 1, 7, and 14, Dorre et al teaches a computer 
implemented method comprising a processor and memory connected to said processor, wherein 
the method further comprises: 
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recognizing two or more concepts in a document wherein said two or more concepts each 
represent a basic idea expressed in said document format (column 5, lines 57-65: "extracting for 
each of said unprocessed documents its features... in a feature vector"); 

recognizing a concept association for the two or more recognized concepts associated 
with a conceptual model that includes the concept association for the two or more recognized 
concepts (column 4, lines 65-67; column 6, lines 1-5; column 12, lines 17-67; column 13, lines 
1-59: i.e. different concepts associated via the hierarchical clustered taxonomy); 

indicating a concept type associated with said document using the conceptual model 
(column 5, lines 55-65: "unprocessed document with said category-scheme of said cluster": i.e. 
the document is associated with a cluster with the most similar feature-vector; column 14, lines 

25- 65: "categorization tool assigns documents to predefined categories"); , wherein the concept 
type comprises a group of one or more concepts that represent a similar idea (column 2, lines 25- 
45: "cluster a generalized title or cluster label... group documents by subject"; column 14, lines 

26- 35: "assign documents to preexisting categories, sometimes called topics or themes"). 

Doerre further teaches wherein the document could come from a multitude of documents 
(column 4, lines 30-32). Dorre does not specifically teach wherein the initial document format 
had to be converted to one of the common document format to be processed. Weiser et al teach 
converting a document format (email message) from an email format to a common generic 
format (column 12, lines 53-55). It would have been obvious to one of ordinary skill in the art at 
the time of the invention for Dorre to have converted a document's initial format to a common 
document format, because Weiser et al taught that by doing so the common format can be 
understandable by the document system (column 12, lines 44-56: i.e. converting a document to a 
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format able to be processed by the a specific system provides the obvious advantage of being 
able to process the document in that system). 

Doerre further teaches wherein recognizing and categorizing documents was well known 
to enhance further document searching and retrieval (column 8, lines 37-40: "powerful and 
flexible queries"; column 9, lines 1-3). Doerre does not specifically teach receiving a search 
query associated with said concept type and identifying based at least in part on the association 
of the concept type with said document, that said document is responsive to said search query. 
Chakrabarti et al teach wherein a search query was associated with said concept type 
identification (column 2, lines 23-28 & 58-60: "database of Web pages that is focused on a 
predefined topic or topics"; column 3, lines 52-57: "focused database... receiving a search 
query"; column 5, lines 13-27); identifying said concept at least in part by using said concept 
identification of said search query (column 2, lines 58-60: "generate a database of Web pages 
that is focused on a predefined topic or topics"; column 5, lines 21-25: "a user can search the 
database 30 efficiently for Web pages of interest, i.e., only for Web pages relating to the topic of 
which the database 30 was focuses"); utilizing the conceptual model (column 4, lines 61-66; 
column 5, lines 13-27) to determine that said document was associated with said identified 
concept (column 2, lines 58-60; column 3, lines 52-57: column 5, lines 13-27); and concluding at 
least in part on the determination that said document was associated with said identified concept, 
that said document was responsive to said search query (column 3, lines 52-57; column 4, lines 
61-66; column 5, lines 13-27). It would have been obvious to one of ordinary skill in the art at 
the time of the invention for Dorre to have received a search query associated with said concept 
type for identifying said concept at least in part by using said identification, because Chakrabarti 
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et al teach that utilizing models (Fig. 1 : 35B & 35B) to associated documents with a predefined 
topic or topics (i.e. concepts) allows efficient searching of said topics by users (column 2, lines 
58-60; column 3, lines 52-57). 

-In regard to dependent claim 2, Doerre teaches for each of said two or more concepts, 
identifying a plurality of features in said document format, wherein said plurality of features 
represent evidence of said of one or said two or more concepts in said format (column 5, lines 5- 
65: "extracting for each. ..unprocessed documents. ..features and computing its feature statistics in 
a feature-vector"). 

-In regard to dependent claims 4 and 19, Doerre teaches based on conceptual model, 
generating an auto-attribute, said auto-attribute being a descriptive label for said document 
(column 2, lines 25-45: "cluster a generalized title or cluster label... group documents by 
subject"; column 14, lines 26-35: "assign documents to preexisting categories, sometimes called 
topics or themes"). 

-In regard to dependent claims 5, 18, and 20, Doerre teaches based on said conceptual 
model assigning said common format document to a subject category in a directory (column 13, 
lines 1-24; column 14, lines 26-65). 

-In regard to dependent claim 8, Dorre teaches wherein said conceptual model includes 
a concept dictionary (column 14, lines 57-65: "category scheme is a dictionary"). 
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-In regard to dependent claim 11, Doerre teaches wherein the conceptual model 
includes a noise dictionary (column 6, lines 37-47). 

-In regard to dependent claim 12, Doerre teaches assigning a subject category to said 
document based at least in part upon said conceptual model (column 12, lines 61-67; column 13, 
lines 1-54). 

-In regard to dependent claim 13, Doerre teaches wherein assigning the subject 
category follows an auto-categorization rule (column 5, lines 55-67; column 6, lines 1-3; column 
14, lines 57-67; column 15, lines 1-20). 

-In regard to dependent claim 15, as shown above, Doerre teaches wherein the 
document could come from a multitude of documents over the Internet (column 1, lines 18-24; 
column 4, lines 30-32). Doerre does not specifically teach wherein the initial document format 
had to be converted to one of the common document format to be processed. Weiser et al teach 
converting a document format (email message) from an email format to a common generic 
format (column 12, lines 53-55). It would have been obvious to one of ordinary skill in the art at 
the time of the invention for Dorre to have converted a document's initial format to a common 
document format, because Weiser et al taught that by doing so the common format can be 
understandable by the document system (column 12, lines 44-56: i.e. converting a document to a 
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format able to be processed by the a specific system provides the obvious advantage of being 
able to process the document in that system). 

-In regard to dependent claim 16, Doerre teaches separating the text content from said 
initial format document for categorizing documents based on text analysis techniques (column 5, 
lines 55-65: "extracting for each of said unprocessed documents its features"; column 9, lines 5- 
67; column 10, lines 1-67; column 1 1, lines 1-61). As shown above in dependent claim 15, 
Doerre does not teach converting the initial document format into a common document format. 
Weiser et al teach converting a document format (email message) from an email format to a 
common generic format (column 12, lines 53-55). It would have been obvious to one of ordinary 
skill in the art at the time of the invention for Doerre to have converted its initial format 
document to one of the common document formats listed above, because Weiser et teach by 
doing so the common format can be understandable by the document system (column 12, lines 
44-56)(i.e. converting document to a format able to be processed by the a specific system 
provides the obvious advantage of being able to process the document in that system), wherein it 
would have also been obvious to incorporate the text from the initial document into the said 
common document, because Doerre teaches the textual content was what was needed to 
categorize the incoming documents (column 9: Section "Text Analysis Functions"). 
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-In regard to dependent claim 21, Doerre teaches wherein the conceptual model 
includes a concept association dictionary (column 13, lines 1-24: "tree constructed... contains the 
complete clustering information including all inter- and intra-cluster similarities"). 

8. Claims 3, 9, 10, and 17 are rejected under 35 U.S.C. 103(a) as being unpatentable over 
Doerre et al (US-6 5 446,061 09/03/02) in view of Weiser et al (US-5,982,507 1 1/09/99) in view 
of in further view of Chakrabarti et al (US-6,4 18,433 07/09/02) in further view of Russell-Falla 
etal(US-6,675,162 01/06/04). 

-In regard to dependent claims 3, 9, and 17, Doerre extracting features from a 
document and establishing a plurality of thresholds to be associated with said features (column 
13, lines 24-54). Doerre does not specifically teach calculating a concept weight for one of said 
two or more concepts using a plurality of feature weights associated with said plurality of 
features wherein said concept weight represents a recognition confidence level for one of said 
two or more concepts and comparing said concept weight with a predetermined thresholds. 

Russell-Falla et al teaches calculating a concept weight/confidence level for one of said 
two or more concepts ("calculating a rating of the page")(column 3, lines 54-57) using a plurality 
of feature weights ("requires a weighting be provided for each word of phrase in the 
list")(column 3, lines 46-57) associated with said plurality of features ("regular 
expressions")(column 2, lines 55-59; column 8, lines 9-19) wherein said concept weight 
represents a recognition confidence level for one of said two or more concepts (column 3, lines 
54-57) and comparing said concept weight with a predetermined thresholds (column 2, lines 64- 
67; column 3, lines 1-16). It would have been obvious to one or ordinary skill in the art at the 



Application/Control Number: Page 9 

09/815,591 

Art Unit: 2178 

time of the invention for Doerre to have calculated a weighting for the unprocessed documents 
feature values to be compared to threshold values as taught in Russell-Falla, because Russell- 
Falla taught that through user selectable threshold values a documents relevance could more 
easily tailored to a specific user (column 2, lines 64-67; column 3, lines 1-16). 

-In regard to dependent claim 10, Doerre teaches setting specific threshold values for 
concept inclusion into the conceptual model at a plurality levels based on exceeding and or 
meeting certain thresholds (column 13, lines 1-55: "set of singleton clusters each containing a 
single documents. . .allows the user to set. . .thresholds of intra-cluster similarity") 

9. Claim 6 is rejected under 35 U.S.C. 103(a) as being unpatentable over Doerre et al (US- 
6,446,061 09/03/02) in view of Weiser et al (US-5,982,507 1 1/09/99) in view of in further view 
of Chakrabarti et al (US-6,4 18,433 07/09/02) in further view of W3C's, "Extensible Markup 
Language (XML) 1.0", 02/10/98, pp. 1-2, http://www.w3.org/TR/1998/REC-xml-19980210. 

-In regard to dependent claim 6, Doerre teaches wherein the document could come 
from a multitude of documents over the Internet (column 1, lines 18-25; column 4, lines 30-32). 
Doerre does not specifically teach wherein a common format was an XML document. W3C 
teaches wherein using XML was notoriously well known in the art for web applications (pp. 2: 
Section 1.1). It would have been obvious to one of ordinary skill in the art at the time of the 
invention, for one of the common formats of Doerre to have been XML, because W3C teaches 
that the XML format provides the benefits of being easy to create, being easy to write programs 
which process XML documents, and being human-legible and reasonably clear (pp. 2: Section 
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I. 1). It was also notoriously well known in the art at the time of the invention that XML was an 
International document standard. 

Response to Arguments 

10. Applicant's arguments with respect to claims 1, 7, and 14 have been considered but are 
moot in view of the new ground(s) of rejection. 

Conclusion 

I I . The prior art made of record and not relied upon is considered pertinent to applicant's 
disclosure. 

Please note the additionally cited prior art on the accompanying PTO-892 form. 

12. Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to Adam L. Basehoar whose telephone number is (571)-272-4121. 
The examiner can normally be reached on M-F: 7:00am - 4:00pm. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Steve Hong can be reached on (571) 272-4124. The fax phone number for the 
organization where this application or proceeding is assigned is 571-273-8300. 
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Information regarding the status of an application may be obtained from the Patent 
Application Information Retrieval (PAIR) system. Status information for published applications 
may be obtained from either Private PAIR or Public PAIR. Statusinformation for unpublished 
applications is available through Private PAIR only. For more information about the PAIR 
system, see http://pair-direct.uspto.gov. Should you have questions on access to the Private PAIR 
system, contact the Electronic Business Center (EBC) at 866-217-9197 (toll-free). 

Adam L. Basehoar 




